Model-Based Adaptive Critic Designs
نویسندگان
چکیده
Editor’s Summary: This chapter provides an overview of model-based adaptive critic designs, including background, general algorithms, implementations, and comparisons. The authors begin by introducing the mathematical background of model-reference adaptive critic designs. Various ADP designs such as Heuristic Dynamic Programming (HDP), Dual HDP (DHP), Globalized DHP (GDHP), and Action-Dependent (AD) designs are examined from both a mathematical and implementation standpoint and put into perspective. Pseudocode is provided for many aspects of the algorithms. The chapter concludes with applications and examples. For another overview perspective that focuses more on implementation issues read Chapter 4: Guidance in the Use of Adaptive Critics for Control. Chapter 15 contains a comparison of DHP with back-propagation through time, building a common framework for comparing these methods.
منابع مشابه
Convergence of Critic-based Training
This paper discusses convergence issues when training adaptive critic designs (ACD) to control dynamic systems expressed as Markov sequences. We critically review two published convergence results of critic-based training and propose to shift emphasis towards more practically valuable convergence proofs. We show a possible way to prove convergence of ACD training.
متن کاملAdaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control
A new approach for engine calibration and control is proposed. In this paper, we present our research results on the implementation of adaptive critic designs for self-learning control of automotive engines. A class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming is used in this research project. The goals of the present learning ...
متن کاملAdaptive critic designs: A case study for neurocontrol
-For the first time, different adaptive critic designs (ACDs), a conventional proportional integral derivative ( PID) regulator and backpropagation of utility are compared for the same control problem---automatic aircraft landing. The original problem proved to contain little challenge since various conventional and neural network techniques had solved it very well. After the problem had been m...
متن کاملBeyond Adaptive Critic - Creative Learning for Intelligent Autonomous Mobile Robots
Intelligent industrial and mobile robots may be considered proven technology in structured environments. Teach programming and supervised learning methods permit solutions to a variety of applications. However, we believe that to extend the operation of these machines to more unstructured environments requires a new learning method. Both unsupervised learning and reinforcement learning are pote...
متن کاملSingle Network Adaptive Critic for Vibration Isolation Control ?
Vibration isolation control is the critical issue to guarantee the performance of various vibration-sensitive instruments and sensors in practical engineering systems. In this paper, single network adaptive critic (SNAC) based controllers are developed for vibration isolation applications. The SNAC approach differs from the typical action-critic dual network structure in adaptive critic designs...
متن کامل